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Abstract 

The application of neural networks in the data 
mining has become wider and wider. Neural networks 
have high acceptance ability for noisy data, high accuracy 
and are preferable in data mining. We focus on the 
data mining process based on neural network. In this 
paper the data mining based on adaptive neural network 
is researched in detail, the key technology and ways to 
achieve the data mining based on neural networks are also 
researched. 
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1. Introduction 


Data mining is a process of finding and interpreting 
potential and useful knowledge or information from large 
quantities of data. In the past decade, the techniques of data 
mining caused the attention of many scholars. It has 
applied in various fields, for example, the finance, retail 
trade, the communication, manufacturing industry, medical 
treatment and government decision [1-3]. However, it 
hardly applied in the design of mechanical structures. The 
results of many research fields such as statistics, pattern 
recognition, artificial intelligence, machine learning, 
database and management information system are applied 
to the data mining [4-6]. In this paper, the back 
propagation (BP) neural network method is used as the 
technique of data mining to analyze the effects of structural 
technologic parameters on stress in the weld region of the 
shield engine rotor in a submarine. 

The machine learning and the statistics are two main 
technical approaches of data mining. The machine 
learning, as a broad subfield of artificial intelligence, is 
concerned with the development of algorithms and 
techniques that allow computers to learn ability to 
achieve the tasks of identifying, inducing, classification, 
predication etc. Artificial Neural Network (ANN) and 
Decision Tree are the most widely applied methods in 
those fields. ANN is an information processing paradigm 
that is inspired by the way biological nervous 
systems, such as the brain, process information. It is 
composed of a large number of highly interconnected 
processing elements (neurons) working in unison to solve 
specific problems. The most typical neural networks are 
the BP neural network, the Hopfield neural networks 
and the adaptive neural networks. 

As the other technical support of data mining, the 
statistics offers the most fundamental theory of data 


mining techniques based on the precise mathematical 
approach. In recent years, agglomerative methods are 
widely applied in data mining. They can take each 
record as a class at the beginning. Then new classes 
should continuously agglomerated used K-mean 
algorithms until only one class could be obtained. They 
can solve many problems such as pattern recognition or 
data classification. Scientists of all fields for its 
developments have interested neural network. Neural 
network is a complex network system which generated 
with simulating the image intuitive thinking of human, 
on the basis of the research of biological neural network, 
according to the features of biological neurons and neural 
network and by simplifying, summarizing and refining. It 
uses the idea of non-linear mapping, the method of 
parallel processing and the structure of the neural 
network itself to express the associated knowledge of 
input and output. Initially, the application of the neural 
network in data mining was not optimistic, and the 
main reasons are that the neural network has the 
defects of “black-box”, we can not understand the 
learning and decision-making process in the network, 
poor interpret ability and long training time. But its 
advantages such as high affordability to the noise data 
and low error rate, the continuously advancing and 
optimization of various network training algorithms, 
especially the continuously advancing and improvement 
of various network pruning algorithms and rules 
extracting algorithm, make the application of the neural 
network in the data mining increasingly favored by the 
overwhelming majority of users. In this paper the data 
mining based on the neural network is researched in 
detail. 

2. Principle of data mining based on 
neural networks 


Neural networks, with their remarkable ability to 
derive meaning from complicated or imprecise data, can 
be used to detect trends that are too complex to be noticed 
by either humans or computer techniques. The 
application of data mining based on neural networks can 
be generally divided into three stages: data preparation, 
modelling and knowledge discovery, as showed in Figure 
I. 

A. Data Preparation 

So-called data preparation is the definition and 
expression of the mined data, which can make the mined 
data suitable for the algorithm. Data preparation is the 
most important step of data mining. It mainly includes 
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two processes, namely data selection and expression. 

Neural network method is used for pattern recognition, 
expert system, classification, feature mining, clustering, 
prediction. Neural network comes from the neurons 
structure theory of animals, bases on the M-P model and 
Hebb learning rule. So in essence it is a distributed matrix 
structure. Through training data mining, the neural 
network method gradually calculates (including repeated 
iteration) the weights the neural network connected. The 
neural network model can be broadly divided into the 
following four types: 

(1) Feed-forward networks: it regards the perception 
back-propagation model and the function network as 
representatives, and mainly used in the areas such as 
prediction and pattern recognition; 

(2) Feedback network: it regards Hopfield discrete 
model and continuous model as representatives, and 
mainly used for associative memory and optimization 
calculation; 

(3) Self-organization networks: it regards adaptive 
resonance theory (ART) model and Kohonen model as 
representatives, and mainly used for cluster analysis. 

(4) Random neural network: it is a special kind of 
artificial neural network, which is developed recently. As 
a biological neural mathematical model, it has advantages 
of associative memory, image processing and 
combinatorial optimization. 

Artificial neural network has the characteristics of 
distributed information storage, parallel processing, 
information, reasoning, and self-organization learning, 
and has the capability of rapid fitting the non-linear data. 
At present, artificial neural network has some difficult 
problems. Aiming at the difficult problems(data 
convergence, stability, local minimum and the training 
parameters adjustment ) some people adopted the method 
of combining artificial neural networks and genetic gene 
algorithms and achieved better results, so it can solve 
many problems which are difficult for other methods to 
solve. 

3. Neural Network based DM 

General data mining process can be composed by three 
main phases: data preparation, data mining, expression 
and interpretation of the results, data mining process is the 
reiteration of the three phases. The details are shown in 
Figure 1. 

The data mining based on neural network process can 
be composed by four main phases: model option, data 
preparing, rules option and result assessment, according 
to the result assessment the process is the reiteration of 
the four phases. The details are shown in Figure 2. 

3.1. NN Model 


Model option is to choice a data mining model based on 
neural work. The types of data mining model based on 
neural network are hundreds, but there are only two types 
most used which are the data mining based on the self¬ 
organization neural network and on the fuzzy neural 
network. The theory of the data mining based on self¬ 
organization neural network is evolutionism theory that is 
inheritance-aberrance-choice-transform. It can develop from 
simpleness to complication by itself. So, self- organization 
process is a process of learning without teachers. 

The fuzzy neural networks frequently used in data 
mining are fuzzy perception model, fuzzy BP network, 
fuzzy clustering Kohonen network, fuzzy inference 
network and fuzzy ART model. Using fuzzy theory and 
neural network to structure and train fuzzy neural network, 
the method can overcome the shortcomings of neural 
network such as complex structure, long training time and 
lack of understandable representation of results. 
Establishment and training of fuzzy neural network which 
meet the precision requests realize the utilization fuzzy 
neural network method to withdraw the knowledge from 
the database. It can not only increase its output expression 
capacity but also the system becomes more stable. 

3.2. Data preparation 

Data preparation is to define and process the mining 
data to make it fit the data mining model which has been 



selected. Data preparation plays a decisive role in the 
entire data mining process. It mainly includes the 
following four processes, and they are data cleaning, data 
option, data preprocessing, data expression. 

Data cleaning: Data cleansing is to eliminate the noise 
data and find out the disrelated. data, eliminate the 
vacancy value of the data, correct the inconsistencies data in 
the data. 

Data option: Data option is to select the data arrange 
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and row used in this mining. 

Data preprocessing: Data preprocessing is to enhanced 
process the clean data which has been selected. 

Data expression: Data expression is very important to a 
mining. Most of the data mining tools can only handle 
numerical data, but it is impossible that only numerical 
data in a database, so it is need to transform the sign data 
into numerical data. According to the character of the sign 
data, they all basically can be simply come down to sign 
data, discrete numerical data and serial numerical data 
three logical data types. Each data type has a appropriate 
method to transform the sign data. To sign data, the 
simplest method is to establish a table with one-to-one 
correspondence between the sign data and the numerical 
data. To discrete numerical data and serial numerical data, 
the more complex approach is to adopt appropriate Hash 
function to generate a unique numerical data according to 
given string. The detail can be seen in reference 14. 

3.3. Rules Selection 

Rule selection is the traditional problem of data 
mining, and is the core problem, too. There are many 
methods to extract rules, in which the most commonly 
used methods are black-box method , extract fuzzy rules 
method , BP network method , the method of extracting 
rules from recursive network, the algorithm of binary 
input and output rules extracting, partial rules extracting 
algorithm (Partial-RE) and full rules extracting algorithm 
(Full-RE). 

3.4. Result assessment 

Different application has different rules of assessment , 
but, in general terms, the result can be assessed in 
accordance with the following objectives [15] . 

(1) Whether it can output a right result ,when we input 
a test example. 

(2) Test the accuracy of the best results in the given 
data set. 

(3) Detect how much knowledge in the neural network 
has not been extracted. 

(4) Detect the inconsistency between the extracted rides 
and the trained neural network. 

4. Fuzzy neural network based DM 

The types of data mining based on neural network are 
hundreds, but there are only two types most used which 
are the data mining based on the self-organization neural 
network and on the fuzzy neural network. As an example, 
we discuss the data mining based on fuzzy neural network. 

4.1 The structure of the fuzzy neural network 


We can set up a fuzzy neural network model which has 
five layers 1191 , seen as in Figure 3. 



Figure 3. - Three Layer Neural Network 

In this model, the first layer is input layer; the second 
compute affiliation relation of the input data; the third is 
data preprocessing layer; the fourth is rules option, and 
the fifth is compute the result and output it. 

4.2. The standard of the sample data 

There is much of the history data in every corporation. 
We must choice some useful data and extract the 
attribute of the data, and then prepare all the data 
according to the data preparation stage. Suppose there 
are n samples data, they are x,,x 2 ,...,x k , and their 
corresponding history result data. We use all the sample 
data as training example and get the original affiliation 
relation, weights. 

4.3. Rules option and adjusting 

After training the samples and their expected 
membership corresponding to various types in learning 
stage fuzzy network will have the ability to reflect the 
affiliation relation between the input and output in 
training set, and can give the membership of the 
recognition pattern in data mining. Because of many cases 
have not a certain conclude, we must use much of the 
expert knowledge to enhance quality of the data 
preprocessing and the rules option. According to the 
result, adjust the affiliation relations, the weights and the 
rules again and again, then we can get the good result. 

4.4 Effective combination of knowledge 
processing and neural computation 

It may be easy to found a model, but how can we 
evaluation it? Evaluating whether a data mining 
implementation algorithm is fine the following indicators 
and characteristics can be used: (1) whether the data 
mining implement algorithm can adapt the complex 
circumstances. Especially, whether high-quality modeling 
under the circumstances of noise and data half-baked; (2) 
whether the model is easy to understand by a special user. 
Especially, the model must be understood by users and 
can be used for decision-making; (3) whether the model 
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has a good interactive characteristic. In order to improve 
the modeling quality, the model must receive area 
knowledge, user’s rules enter and weight adjust. As a user, 
it is not enough to depend on the neural network model 
providing results, before important decision-making users 
need to understand the rationale and justification for the 
decision-making. Therefore, in the ANN data mining 
knowledge base(common sense, rules , test data) should 
be established in order to accede domain knowledge and 
the knowledge ANN learning to the system in the data 
mining process. 

4.5 Input/output interface 

Input/output interface is very important to a user. A 
good interface with relational database, multi-dimensional 
database and data warehouse should be established to 
meet the needs of data mining. 

5. Conclusion 

Data Mining has a lot to gain from neural networks. 
Their ability to learn makes them very flexible and 
powerful. Furthermore there is no need to devise an 
algorithm in order to perform a specific task; i.e. there is 
no need to understand the internal mechanisms of that 
task. They are also very well suited for real time systems 
because of their fast response and computational times 
which are due to their parallel architecture. 

The neural net supplies answers, but not 
explanations. Indeed, the neural model embodies 
correlations (like intuitive associations), not causal 
relations (explanations). Examining the neural network 
itself only shows us meaningless numeric values. The 
neural model is a ‘black box’. On the other hand, this 
model being continuous and derivable, one can, beyond 
simple interrogation, ‘explore’ it to determine typical 
profiles, the degree of explicative power of each variable, 
reclassify collections of examples to determine their 
associated probabilities, visualize data and predictions, 
very predictive: the models are very faithful to reality. 

Perhaps the most exciting aspect of neural networks 
is the possibility that some day 'consious' networks might 
be produced. There is a number of scientists arguing that 
conciousness is a 'mechanical' property and that 
'consious' neural networks are a realistic possibility. 

Neural networks have a huge potential and their 
integration with artificial intelligence, fuzzy logic and 
other related subjects will increase their future application 
to Data mining. 
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